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Abstract 

The subsumption problem with respect to terminologies in the description logic ACC is 
EXP-complete. We investigate the computational complexity of fragments of this problem 
by means of allowed Boolean operators. Hereto we make use of the notion of clones in the 
context of Post's lattice. Furthermore we consider all four possible quantifier combinations 
for each fragment parameterized by a clone. We will see that depending on what quantifiers 
are available the classification will be either tripartite or a quartering. 

1 Introduction 

Description logics (DL) play an important role in several areas of research, e.g., semantic web, 
database structuring, or medical ontologies [5, 9, 10, 23]. As a consequence there exists a vast 
range of different extensions where each of them is highly specialized to its field of application. 
Nardi and Brachman describe subsumption as the important inference problem within DL [24]. 
Given two (w.l.o.g. atomic) concepts C, D and a set of axioms (which are pairs of concept 
expressions A, B stating A implies B), one asks the question whether C implies D is consistent 
with respect to each model satisfying all of the given axioms. Although the computational 
complexity of subsumption in general can be between tractable and EXP [16, 17] depending 
on which feature 1 is available, there exist nonetheless many DL which provide a tractable (i.e., 
in P) subsumption reasoning problem, e.g., the DL-Lite and EC families [1, 2, 3, 4, 12]. One 
very prominent application example of the subsumption problem is the SNOMED CT clinical 
database which includes about 400.000 axioms and is a subset of the DL £ C ++ [13, 27]. 

In this paper we investigate the subsumption problem with respect to the most general (in 
sense of available Boolean operators) description logic ACC. It is known that the unrestricted 
version of this problem is EXP-complete due to reducibility to a specific DL satisfiability problem 
[5], and is therefore highly intractable. Our aim is to understand where this intractability comes 
from or to which Boolean operator it may be connected to. Therefore we will make use of 
the well understood and much used algebraic tool, Post's lattice [26]. At this approach one 
describes sets formulae (w.r.t. to some clone) which can be constructed from a given finite set 
of Boolean functions by arbitrary composition of these functions, or projections. For a good 
introduction into this area consider [8]. The main technique is to investigate fragments of a 
specific decision problem by means of allowed Boolean functions; in this paper this will be the 
subsumption problem. As Post's lattice considers any possible set of all Boolean functions a 
classification by it always yields an exhaustive study. This kind of research has been done 
previously for several different kind of logics, e.g., temporal, hybrid, modal, and nonmonotonic 
logics [6, 14, 15, 18, 28, 29]. 

Main results. The most general class of fragments, i.e., those which have both quantifiers 
available perfectly show how powerful the subsumption problem is. Having access to at least one 
constant (true or false) leads to an intractable fragment. Merely for the fragment where only pro- 
jections (and none of the constants) are present it is not clear if there can be a polynomial time 
algorithm for this case and has been left open. If one considers the cases where only one quantifier 

1 These features, for instance, can be existential or universal restrictions, availability of disjunction, conjunc- 
tion, or negation. Furthermore, one may extend a description logic with more powerful concepts, e.g., number 
restrictions, role chains, or epistemic operators [5]. 
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Table 1: All clones and bases relevant for this paper. 



is present, then the fragments around disjunction (case V), respectively, the ones around conjunc- 
tion (case 3) become tractable. Without quantifiers conjunctive and disjunctive fragments are 
P-complete whereas the fragments which include either the affine functions (exclusive or) , or can 
express x V (y A z), or x A (y V z), or self-dual functions (i.e., f(x\, . . . , x n ) = -if(-ixi, . . . , ^i„)) 
are intractable. Figure 1 depicts how the results directly arrange within Post's lattice. 

2 Preliminaries 

In this paper we will make use of standard notions of complexity theory [25]. In particular, we 
work with the classes NLOGSPACE, P, coNP, EXP, and the class ©LOGSPACE which corresponds 
to those nondeterministic Turing machine running in logarithmic space whose computations 
trees have an odd number of accepting paths. Usually all stated reductions are logarithmic 
space many-one reductions We write A B iff A <£s b and B <£ s A hold. 

Post's Lattice. Let T, _L denote the truth values true, false. Given a finite set of Boolean 
functions B, we say the clone of B contains all compositions of functions in B plus all projections; 
the smallest such clone is denoted with [B] and the set B is called a base of [B] . The lattice of 
all clones has been established in [26] and a much more succinct introduction can be found in [8] . 
Table 1 depicts all clones and their bases which are relevant for this paper. Here maj denotes 
the majority, and id denotes identity. Let /: {T,_L}™ — > {T,_L} be a Boolean function. Then 
the dual of f, in symbols dual(/), is the n-ary function g with g(x\, . . . , x n ) = f(x\, . . . , x^). 
Similarly, if B is a set of Boolean functions, then dual(B) := {dual(J) | / € B}. Further, 
abusing notation, define dual(3) := V and dual(V) = 3; if Q C {3,V} then dual(Q) := 
{dual(D) | D e Q}. 

Description Logic. We use the standard syntax and semantics of ACC as in [5]. Additionally 
we we adjusted them to fit the notion of clones. The set of concept descriptions (or concepts) is 
defined by C :— A \ o/(C, . . . , C) \ 3R.C | Vi?.C, where A is an atomic concept (variable), R is a 
role (transition relation) , and Of is a Boolean operator which corresponds to a Boolean function 
/: {T,_L}" — > {T,_L}. For a given set B of Boolean operators and Q C {3,V}, we define that 
a B-Q-concept uses only operators from B and quantifiers from Q. Hence, if B — {A,V} then 
[B] = BF, and the set of S-concept description is equivalent to (full) ACC. Otherwise if [B] C BF 
for some set B, then we consider real subsets of ACC and cannot express any (usually in ACC 
available) concept. An axiom is of the form CCD, where C and D are concepts; C = D is 
the syntacic sugar for C C D and D C C. A TBox is a finite set of axioms and a B-Q-TBox 
contains only axioms of B-Q-concepts. 

An interpretation is a pair I = (A 1 , - 1 ), where A 1 is a nonempty set and - x is a mapping 
from the set of atomic concepts to the power set of A 1 , and from the set of roles to the power 
set of A 1 x A 1 . We extend this mapping to arbitrary concepts as follows: 

(3R.C) 1 = {xeA x \{yeC x \ (x,y) e R 1 } ^ 0}, 
(VR.Cf - {x e A 1 1 {y e C 1 I (x,y) £ R 1 } = 0}, 
( o f (C u . . . , C n )f - {x e A 1 1 f(\\x e Cf ||, . . . , \\x e C x \\) = T}, 
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where ||a; G C x \ \ = 1 if x € C 1 and ||x G C" 27 1 1 = if x ^ C 1 . An interpretation I satisfies the 
axiom C C D, in symbols I |= C C £), if C 1 C Z) 1 . Further I satisfies a TBox, in symbols 
1 1= T, if it satisfies every axiom therein; then X is called a model. Let Q C {3,V} and B be a 
finite set of Boolean operators. Then for the TBox-concept satisfiability problem, TCSATg(_B), 
given a B-Q-TBox T and a -B-Q-concept C, one asks if there is an X s.t. I (= T and C 1 7^ 0. This 
problem has been fully classified w.r.t. Post's lattice in [21]. Further the Subsumption problem, 
SUBSg(£>), given a S-Q-TBox and two B-Q-concepts C,D, asks if for every interpretation X 
it holds that X\=T implies C 1 C D 1 . 

As subsumption is an inference problem within DL some kind of connection in terms of reduc- 
tions to propositional implication is not devious. In [7] Beyersdorff et al. classify the propositional 
implication problem IMP with respect to all fragments parameterized by all Boolean clones. 

Theorem 1 ([7]) Let B be a finite set of Boolean operators. 

1. IfCC [B] for C e {S 00 , D 2 ,Sio} ; then IMP(B) is coNP-complete w.r.t. <^ c ° 2 . 

2- If\- 2 C [B] C L, then IMP(B) is ®LOGSPACE-comp/ete w.r.t. <^ c °. 

3. J/N 2 C [B] C N, then IMP(B) is in AC [2]. 

^. Otherwise IMP(_B) G AC . 



3 Interreducibilities 

The next lemma proves base independence for the subsumption problem. This is important to 
generalize the results to clones corresponding to their respective base. In particular, this kind 
of property enables us to use standard bases for every clone within our proofs. The result is 
proven in the same way as in [22, Lemma 4]. 

Lemma 1 Let Bi,B 2 be two sets of Boolean operators such that [Bi] C [B 2 \, and let Q C {3, V}. 
Then SUBS s (Bi) <£s SUBS S (B 2 ). 

The following two lemmata deal with a duality principle of subsumption. The correctness of 
contraposition for axioms allows us to state reduction to the fragment parameterized by the dual 
operators. Further having access to negation allows us in the same way as in [20] to simulate 
both constants. 

Lemma 2 Let B be a finite set of Boolean operators and Q C {V, 3}. Then SUBSg(B) <j£ s 
SUBS dual(Q) (dual(i?)). 

Proof Here we distinguish two cases. Given a concept A define with the concept ~^A in 
negation normal form (NNF). 

First assume that -. G [B]. Then (T,C,D) G SUBS Q (_B) if and only if for any inter- 
pretation X s.t. X |= T it holds that C 1 C D 1 if and only if for any interpretation X s.t. 
1 1= V := {F^ C £T I E E F e T} it holds that (^D) x C (^C) 1 if and only if (T, D^,CT) G 
SUBSduai(Q)(d ua l(-B))- The correctness directly follows from dual(^) = -1. 

Now assume that -1 ^ [B]. Then for a given instance (T, C, -D) it holds that for the con- 
traposition instance ({F^ C E~~ ECfe T},-D^,C^) before every atomic concept occurs a 
negation symbol. Denote with ({F~~ n E~~ \ E n F £ T}, D~~ ,C~~) pos the substitution of any 
such negated atomic concept ^A by a fresh concept name A' . Then (T, C, D) G SUBSg(-B) iff 
{{F- C £T \EQF G T}, JT , CT)p° s G SUBS dual(s) (dual(B)). ~ , 

Lemma 3 Let B be a finite set of Boolean operators s.t. N 2 C [B] and Q C {3,V}. TTien if 
ZioZds that SUBS C (B) =JS g SUBS S (5U{T,_L}). 

Using Lemma 4.2 in [7] we can easily obtain the ability to express the constant T whenever 
we have access to conjunctions, and the constant _L whenever we are able to use disjunctions. 

2 A language A is AC many-one reducible to a language B (A <^f° B) if there exists a function / computable 
by a logtime-uniform AC°-circuit familiy such that x 6 A iff f{x) 6 B (for more information, see [31]). 
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Lemma 4 Let B be a finite set of Boolean operators and Q C {V, 3}. 

1. If Eq C [B] , then SUBS S (B) =j£ s SUBSg(B U {T}). 

2. //Vo C [B], tfcen SUBSg(B) =£s SUBS Q (B U {!_}). 

The connection of subsumption to terminology satisfiability and propositional implication is 
crucial for stating upper and lower bound results. The next lemma connects subsumption to 
TCSAT and also to IMP. 

Lemma 5 Let B be a finite set of Boolean operators and Q C {V, 3} be a set of quantifiers. 
Then the following reductions hold: 

1. IMP(B) <£s SUBS (B). 

2. SUBS Q (B) <£s TCSAT Q (BU{^}). 

3. TCSAT Q (B) SUBS Q (B U {_L}). 

Proof 1. Holds due to (») e IMP(B) iff (C V ,C^,<D) G SUBS (B), for concept descrip- 
tions C v — f{ip),C^ = f(ip) with / mapping propositional formulae to concept descrip- 
tions via 

/(T) = T, and /(!_) = J_, 
f(x) — C x , for variable x, 
f(g(C 1 ,...,C n ))=o g (f(C 1 ),...,f(C n )) 

where g is an n-ary Boolean function and o g is the corresponding operator. 

2. (C,D,T) G SUBS Q (B) iff (T, C^L>) e TCSAT Q (B U [5]. 

3. (T, C) G TCSAT Q (B) iff (C, _L, T) G SUBS Q (B U {_L}). [5]. ■ 

4 Main Results 

We will start with the subsumption problem using no quantifiers and will show that the problem 
cither is coNP-, P-, NLOGSPACE-completc, or is eLOGSPACE-hard. 

Theorem 2 (No quantifiers available.) Let B be a finite set of Boolean operators. 

1. IfX C [B] for X e {Lo,Li,L3,Sio,Soo,D 2 } } then SUBS (B) is coNP -complete. 

2- If E 2 C [B] C E or V 2 C [B] C V, tften SUBS (B) is P -complete. 

3. If[B] = L 2 , then SUBS (B) is ®LOGSPACE-/iarrf. 

4- Ifh C [B] C N, tfien SUBS (B) is NLOGSPACE-compZete. 
All hardness results hold w.r.t. <|£ s reductions. 

Proof 1. The reduction from the implication problem IMP(B) in Lemma 5(1.) in combi- 
nation with Theorem 1 and Lemma 1 proves the coNP lower bounds of Si , S o, D 2 . The 
lower bounds for L C [B] and L 3 C [B] follow from Lemma 5(3.) with TCSAT (B) being 
coNP-complete which follows from the N P-completeness result of TCSAT (B) shown in 
[22, Theorem 27]. Further the lower bound for l_i C [B] follows from the duality of '©' 
and '=' and Lemma 2 with respect to the case L C [B] enables us to state the reduction 

SUBS (L O ) < l ° e SUBS dual(0) (dual(L o )) = SUBS (l_i). 



The upper bound follows from a reduction to TCSAT (BF) by Lemma 5(2.) and the 
membership of TCSAT (BF) in NP by [22, Theorem 27]. 
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2. The upper bound follows from the memberships in P for SUBS^E) and SUBSy(V) proven 
in Theorems 3 and 4. 

The lower bound for [B] = E 2 follows from a reduction from the hypergraph accessibility 
problem 3 HGAP: set T = {u\ n u 2 C v \ (ui,U2; v) G E}, assume w.l.o.g. the set of source 
nodes as S = {s}, then (G, S, t) G HGAP iff {T,s,t) G SUBS (E 2 ). For the lower bound 
of V 2 apply Lemma 2. 

3. Follows directly by the reduction from IMP(I_2) due to Theorem 1 and Lemma 5(1.). 

4. For the lower bound we show a reduction from the graph accessibility problem 4 GAP 
to SUBS0(l 2 ). Let G — {V,E) be a undirected graph and s,t G V be the vertices for 
the input. Then for T := {{A u C A v ) \ (u,v) G E} it holds that (G,s,t) G GAP iff 
(T, A s , A t ) G SUBS (I 2 ). 

For the upper bound we follow the idea from [22, Lemma 29]. Given the input instance 
(T, C, D) we can similarly assume that for each E C F G T it holds that -E 1 , F are atomic 
concepts, or their negations, or constants. Now (T,C,D) G SUBS0(N) holds iff for every 
interpretation I — (A 1 , - 1 ) and x G A 1 it holds that if x G C 1 then x G -D 1 holds iff for 
the implication graph Gj- (constructed as in [22, Lemma 29]) there exists a path from vq 
to vd- 

Informally if there is no path from vc to then D is not implied by C, i.e., it is possible 
to construct an interpretation for which there exists an individual which is a member of 
C x but not of D 1 . 

Thus we have provided a coNLOGSPACE-algorithm which first checks accordingly to the 
algorithm in [22, Lemma 29] if there are not any cycles containing contradictory axioms. 
Then we verify that there is no path from vc to implying that C is not subsumed by 
D. ■ 

Using some results from the previous theorem we are now able to classify most fragments 
of the subsumption problem using only either the V or 3 quantifier with respect to all possible 
Boolean clones in the following two theorems. 

Theorem 3 (Universal fragments) Let B be a finite set of Boolean operators. 
1- If £2 C [B], then SUBSv(S) is EXP-complete. 
2. 7/N 2 C [B] or L C [B], then SUBS V (S) is EXP-complete. 

3- If Li C [B], then SUBS V (B) is EKP -complete. 

4- IfSoo C [B], then SUBS V (B) is EXP-complete. 

5. IfD 2 C [B] C Di, then SUBS V (B) is coNP-hard and in EXP. 

6. If[B] C V, then SUBS V (B) is P-complete. 

7. If [B] = L 2; then SUBS V (B) is P-hard and in EXP. 
All hardness results hold w.r.t. <j£ s reductions. 

Proof 1. Follows from EXP-hardness of JTo-SUBS which has been shown in [19, Thm 7.6]. 
2. The lower bound for N 2 C [B] is achieved through the reductions 

TCSAT V (N 2 ) <^ SUBSy(N) =^ SUBS V (N 2 ), 

3 In a given hypergraph H = (V,E), a hyperedge e £ E is a pair of source nodes •src(e) £ V X V and one 
destination node dest(e) £ V. Instances of HGAP consist of a directed hypergraph H = (V, E), a set S C V of 
source nodes, and a target node t £ V. Now the question is whether there exists a hyperpath from the set S 
to the node t, i.e., whether there arc hypcredges ei, e2, . . . , such that, for each ej, there are ei x , . . . , ej„ with 
1 < ii, . . . ,i v < i and Ujefjj i v } dest{ej) U src(ej) D src(ei), and src(ei) = 5 and dest{ek) = t [30]. 

4 Instances of GAP are directed graphs G together with two nodes s, t in G asking whether there is a path 
from s to t in G. 
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where the first reduction is due to Lemma 5(3.) and the second equivalence holds through 
Lemma 3 which enables us to always have access to both constants whenever N2 C [B]. The 
EXP-hardness now follows from TCSATv(N2) being EXP-completc proven in [22, Theorem 
32(1.)]. 

The EXP-hardness for l_o C [B] follows from Lemma 5(3.) which states the reduction 
TCSATv(L ) SUBS V (L U {!_}) where [L U {!_}] = L . From [22, Theorem 32 (1.)] 
we know that TCSAT V (L ) is EXP-complete. 

3. The EXP-hardness follows from the following reduction: 

(«) (*>) 

TCSAT 3 (L ) <£s SUBS 3 (L ) SUBS dual(3) (dual(L )) = SUBSvtLr), 

by virtue of TCSAT 3 (L ) being EXP-complete shown in [22, Theorem 32 (1.)] for (a), and 
Lemma 2 for (6). 

4. Follows from Lemma 4 and the EXP-hardness of SUBSy(Mo) overlaid by Theorem 3(1.), 
as M = [S00U {!_}]. 

5. The coNP-hardness follows from SUBS0(D 2 ) being coNP-hard shown in Theorem 2. 

6. For the upper bound Lemma 2 lets us state the reduction SUBSy(V) <j£ g SUBSg(E), 
where the latter is in P by virtue of Theorem 4(6.). 

The lower bound follows again from Lemma 2, and the P-hardness of SUBS 3 (I 2 ) which is 
proven in [22, Lemma 26]. 

7. The P-hardness follows from(6.). ■ 

Theorem 4 (Existential fragments) Let B be a finite set of Boolean operators. 

1. J/V 2 C [B], then SUBS 3 (B) is EXP-complete. 

2. J/N 2 C [B] or L C [B], then SUBS 3 ( J B) is EXP-complete. 
3- If Li C [B], then SUBS 3 (B) is EXP-complete. 

I IfS w C [B], then SUBS 3 (B) is EXP-complete. 

5. IfD 2 C [B] C D l7 then SUBS 3 (B) is coNP-hard and in EXP. 

6. If[B] C E, then SUBS 3 (B) is P -complete. 

7. If [B] = L 2; then SUBS 3 (B) is P-hard and in EXP. 
All hardness results hold w.r.t. <]£ s reductions. 

Proof 1.-3. For the following reductions showing the needed lower bounds for V 2 , IM 2 , l_i, and 
L we use Theorem 3 in combination with the contraposition argument in Lemma 2: 

SUBS V (E 2 ) < l °z SUBS 3 (V 2 ), SUBS V (N 2 ) <£s SUBS 3 (N 2 ), 

SUBSv(Lq) <^ s SUBS 3 (L!), and SUBSvtU) <^ g SUBS 3 (L ). 

4. The needed lower bound follows from Lemma 4 whereas the EXP-hardness of SUBS^M^ 
overlaid by Theorem 4(1.) as Mi = [S10 U {T}]. 

5. The coNP lower bound follows from SUBS0(£?) shown in Theorem 2. 

6. The upper bound follows from the membership of subsumption for the logic £CH in P, 
[11, Thm. 9]. The lower bound is proven in [22, Lemma 26]. 

7. The lower bound follows from Theorem 4(6.). ■ 

Finally the classification of the full quantifier fragments naturally emerges from the previous 
cases to EXP-complete, coNP-, and P-hard cases. 
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Theorem 5 (Both quantifiers available) Let B be a finite set of Boolean operators. 

1. Let X G {N 2 , V 2 , E 2 }. IfXC [B], then SUBS 3V (B) is EXP -complete. 
2- If\ C [B] or li C [B], then SUBS 3V (B) is EXP -complete. 

3. Lf D 2 C [B] C Di, i/ien SUBS 3V (B) is coNP-/iard and in EXP. 
^. 7/ [5] e {l 2 , L 2 } ; t/jen SUBS 3V (B) is P-hard and in EXP. 
All hardness results hold w.r.t. <j£ s reductions. 

Proof 1. Follows from the respective lower bounds of SUBS 3 (7?), resp., SUBSv(7?) shown 
in Theorems 3 and 4. 

2. The needed lower bound follows from Lemma 5(3.) and enables a reduction from the EXP- 
complcte problem TCSAT 3V (I ) [21, Theorem 2 (1.)]. The case SUBS 3V (B) with L C [B] 
follows from the contraposition argument in Lemma 2. 

3. +4. The lower bounds carry over from SUBSET?) for the respective sets B (see Theorem 2).> 



5 Conclusion and Discussion 

Figure 1 visualizes how the results arrange in Post's lattice. The classification has shown that the 
subsumption problem with both quantifiers is a very difficult problem. Even a restriction down 
to only one of the constants leads to an intractable fragment with EXP-completeness. Although 
we achieved a P lower bound for the case without any constants, i.e., the clone l 2 it is not clear 
how to state a polynomial time algorithm for this case: We believe that the size of satisfying 
interpretations always can be polynomially in the size of the given TBox but a deterministic 
way to construct it is not obvious to us. If one starts with an individual instantiating the given 
concept C then it is not easy to decide how to translate a triggered axiom into the interpretation 
(e.g., should a role edge be a loop or not). Further it is much harder to construct such an 
algorithm for the case L 2 having a ternary exclusive-or operator. 

Retrospectively the subsumption problem is much harder than the the usual terminology 
satisfiability problems visited in [21]. Due to the duality principle expressed by Lemma 2 both 
halves of Post's lattice contain intractable fragments plus it is not clear if there is a tractable 
fragment at all. For the fragments having access to only one of the quantifiers the clones 
which are able to express either disjunction (for the universal quantifier) or conjunction (for the 
existential case) become tractable (plus both constants). Without any quantifier allowed the 
problem almost behaves as the propositional implication problem with respect to tractability. 
The only exception of this rule are the L-cases that can express negation or at least one constant. 
They become coNP-complete and therewith intractable. 

Finally a similar systematic study of the subsumption problem for concepts (without respect 
to a terminology) would be of great interest because of the close relation to the implication 
problem of modal formulae. To the best of the author's knowledge such a study has not been 
done yet and would enrich the overall picture of the complexity situation in this area of research. 
Furthermore it would be interesting to study the effects of several restrictions on terminologies 
to our classification, e.g., acyclic or cyclic TBoxes. 

Acknowledgements The author thanks Thomas Schneider (Bremen) and Peter Lohmann 
(Hannover) for several helpful discussions about the paper. 
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Figure 1: Post's lattice showing the complexity of SUBSg(B) for all sets C Q C {3,V} and 
all Boolean clones [B]. 
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